NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Diversifying behaviors for learning in asymmetric multiagent systems

https://doi.org/10.1145/3512290.3528860

Dixit, Gaurav; Gonzalez, Everardo; Tumer, Kagan (July 2022, Genetic and Evolutionary Computation Conference)

Full Text Available
Heterogeneous Agent Coordination via Adaptive Quality Diversity and Specialization

Dixit, Gaurav; Koll, Charles; Tumer, Kagan (July 2021, Genetic and evolutionary computation)
null (Ed.)
In many real-world multiagent systems, agents must learn diverse tasks and coordinate with other agents. This paper introduces a method to allow heterogeneous agents to specialize and only learn complementary divergent behaviors needed for coordination in a shared environment. We use a hierarchical decomposition of diversity search and fitness optimization to allow agents to speciate and learn diverse temporally extended actions. Within an agent population, diversity in niches is favored. Agents within a niche compete for optimizing the higher level coordination task. Experimental results in a multiagent rover exploration task demonstrate the diversity of acquired agent behavior that promotes coordination.
more » « less
Full Text Available
Gaussian Processes as Multiagent Reward Models

Dixit, Gaurav; Zerbel, Nick; Tumer, Kagan (May 2020, AAMAS Conference proceedings)

In multiagent problems that require complex joint actions, reward shaping methods yield good behavior by incentivizing the agents’ potentially valuable actions. However, reward shaping often requires access to the functional form of the reward function and the global state of the system. In this work, we introduce the Exploratory Gaussian Reward (EGR), a new reward model that creates optimistic stepping stone rewards linking the agents potentially good actions to the desired joint action. EGR models the system reward as a Gaussian Process to leverage the inherent uncertainty in reward estimates that push agents to explore unobserved state space. In the tightly coupled rover coordination problem, we show that EGR significantly outperforms a neural network approximation baseline and is comparable to the system with access to the functional form of the global reward. Finally, we demonstrate how EGR improves performance over other reward shaping methods by forcing agents to explore and escape local optima.
more » « less
Full Text Available

Search for: All records